DCMD: Distance-based classification using mixture distributions on microbiome data
نویسندگان
چکیده
Current advances in next-generation sequencing techniques have allowed researchers to conduct comprehensive research on the microbiome and human diseases, with recent studies identifying associations between health outcomes for a number of chronic conditions. However, data structure, characterized by sparsity skewness, presents challenges building effective classifiers. To address this, we present an innovative approach distance-based classification using mixture distributions (DCMD). The method aims improve performance community data, where predictors are composed sparse heterogeneous count data. This models inherent uncertainty counts estimating distribution sample representing each observation as distribution, conditional observed estimated mixture, which then used inputs classification. is implemented into k -means -nearest neighbours framework. We develop two distance metrics that produce optimal results. model assessed simulated study results compared against existing machine learning approaches. proposed competitive when other approaches, shows clear improvement over commonly classifiers, underscoring importance modelling achieving range applicability robustness make viable alternative source code available at https : //github . com/kshestop/DCMD academic use.
منابع مشابه
On Classification of Bivariate Distributions Based on Mutual Information
Among all measures of independence between random variables, mutual information is the only one that is based on information theory. Mutual information takes into account of all kinds of dependencies between variables, i.e., both the linear and non-linear dependencies. In this paper we have classified some well-known bivariate distributions into two classes of distributions based on their mutua...
متن کاملClassification and properties of acyclic discrete phase-type distributions based on geometric and shifted geometric distributions
Acyclic phase-type distributions form a versatile model, serving as approximations to many probability distributions in various circumstances. They exhibit special properties and characteristics that usually make their applications attractive. Compared to acyclic continuous phase-type (ACPH) distributions, acyclic discrete phase-type (ADPH) distributions and their subclasses (ADPH family) have ...
متن کاملFeature-Based Face Recognition Using Mixture-Distance
We consider the problem of feature-based face recognition in the setting where only a single example of each face is available for training. The mixture-distance technique we introduce achieves a recognition rate of 95% on a database of 685 people in which each face is represented by 30 measured distances. This is currently the best recorded recognition rate for a feature-based system applied t...
متن کاملDistance-based mixture modeling for classification via hypothetical local mapping
We propose a new approach for mixture modeling based only upon pairwise distances via the concept of hypothetical local mapping (HLM). This work is motivated by increasingly commonplace applications involving complex objects that cannot be effectively represented by tractable mathematical entities. The new modeling approach consists of two steps. A distance-based clustering algorithm is applied...
متن کاملStatistical Wavelet-based Image Denoising using Scale Mixture of Normal Distributions with Adaptive Parameter Estimation
Removing noise from images is a challenging problem in digital image processing. This paper presents an image denoising method based on a maximum a posteriori (MAP) density function estimator, which is implemented in the wavelet domain because of its energy compaction property. The performance of the MAP estimator depends on the proposed model for noise-free wavelet coefficients. Thus in the wa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: PLOS Computational Biology
سال: 2021
ISSN: ['1553-734X', '1553-7358']
DOI: https://doi.org/10.1371/journal.pcbi.1008799